Overview

Dataset statistics

Number of variables21
Number of observations1541
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory171.0 KiB
Average record size in memory113.6 B

Variable types

Numeric12
Categorical9

Warnings

iclevel has constant value "Four or more years" Constant
instnm has a high cardinality: 1528 distinct values High cardinality
stabbr has a high cardinality: 54 distinct values High cardinality
admssn is highly correlated with enrlft and 1 other fieldsHigh correlation
enrlft is highly correlated with admssn and 1 other fieldsHigh correlation
enrlt is highly correlated with admssn and 1 other fieldsHigh correlation
locale is highly correlated with iclevelHigh correlation
instsize is highly correlated with iclevelHigh correlation
stabbr is highly correlated with iclevelHigh correlation
instcat is highly correlated with iclevelHigh correlation
c15basic is highly correlated with iclevelHigh correlation
iclevel is highly correlated with locale and 6 other fieldsHigh correlation
sector is highly correlated with iclevel and 1 other fieldsHigh correlation
control is highly correlated with iclevel and 1 other fieldsHigh correlation
instnm is uniformly distributed Uniform
unitid has unique values Unique
latitude has unique values Unique
roomamt has 309 (20.1%) zeros Zeros
boardamt has 376 (24.4%) zeros Zeros
applfeeu has 446 (28.9%) zeros Zeros

Reproduction

Analysis started2021-02-21 22:02:13.230581
Analysis finished2021-02-21 22:02:27.195658
Duration13.97 seconds
Software versionpandas-profiling v2.11.0
Download configurationconfig.yaml

Variables

unitid
Real number (ℝ≥0)

UNIQUE

Distinct1541
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean193292.6398
Minimum100654
Maximum491057
Zeros0
Zeros (%)0.0%
Memory size12.2 KiB

Quantile statistics

Minimum100654
5-th percentile110714
Q1154235
median188030
Q3215655
95-th percentile243443
Maximum491057
Range390403
Interquartile range (IQR)61420

Descriptive statistics

Standard deviation67893.94567
Coefficient of variation (CV)0.3512495133
Kurtosis8.091105326
Mean193292.6398
Median Absolute Deviation (MAD)30273
Skewness2.483451338
Sum297863958
Variance4609587858
MonotocityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1699831
 
0.1%
1748441
 
0.1%
1890881
 
0.1%
1235541
 
0.1%
1481311
 
0.1%
1399401
 
0.1%
1276531
 
0.1%
2334261
 
0.1%
1890971
 
0.1%
1952431
 
0.1%
Other values (1531)1531
99.4%
ValueCountFrequency (%)
1006541
0.1%
1006631
0.1%
1007061
0.1%
1007241
0.1%
1007511
0.1%
ValueCountFrequency (%)
4910571
0.1%
4908051
0.1%
4905131
0.1%
4905041
0.1%
4903191
0.1%

instnm
Categorical

HIGH CARDINALITY
UNIFORM

Distinct1528
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
Westminster College
 
3
Union College
 
3
Emmanuel College
 
2
St. John's College
 
2
Wheaton College
 
2
Other values (1523)
1529 

Length

Max length75
Median length24
Mean length25.25697599
Min length6

Characters and Unicode

Total characters38921
Distinct characters60
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1517 ?
Unique (%)98.4%

Sample

1st rowAlabama A & M University
2nd rowUniversity of Alabama at Birmingham
3rd rowUniversity of Alabama in Huntsville
4th rowAlabama State University
5th rowThe University of Alabama
ValueCountFrequency (%)
Westminster College3
 
0.2%
Union College3
 
0.2%
Emmanuel College2
 
0.1%
St. John's College2
 
0.1%
Wheaton College2
 
0.1%
Bethany College2
 
0.1%
Sterling College2
 
0.1%
Marian University2
 
0.1%
Bethel University2
 
0.1%
Anderson University2
 
0.1%
Other values (1518)1519
98.6%
Histogram of lengths of the category
ValueCountFrequency (%)
university846
 
17.1%
college478
 
9.7%
of373
 
7.5%
state193
 
3.9%
the64
 
1.3%
at50
 
1.0%
saint39
 
0.8%
institute38
 
0.8%
and36
 
0.7%
new32
 
0.6%
Other values (1385)2799
56.6%

Most occurring characters

ValueCountFrequency (%)
e3940
 
10.1%
i3502
 
9.0%
3408
 
8.8%
n2707
 
7.0%
t2644
 
6.8%
a2262
 
5.8%
r2244
 
5.8%
o2212
 
5.7%
s2046
 
5.3%
l1991
 
5.1%
Other values (50)11965
30.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter30502
78.4%
Uppercase Letter4737
 
12.2%
Space Separator3408
 
8.8%
Dash Punctuation212
 
0.5%
Other Punctuation59
 
0.2%
Open Punctuation1
 
< 0.1%
Math Symbol1
 
< 0.1%
Close Punctuation1
 
< 0.1%

Most frequent character per category

ValueCountFrequency (%)
U997
21.0%
C837
17.7%
S516
10.9%
M330
 
7.0%
A210
 
4.4%
T178
 
3.8%
B164
 
3.5%
N164
 
3.5%
W156
 
3.3%
P155
 
3.3%
Other values (16)1030
21.7%
ValueCountFrequency (%)
e3940
12.9%
i3502
11.5%
n2707
8.9%
t2644
8.7%
a2262
 
7.4%
r2244
 
7.4%
o2212
 
7.3%
s2046
 
6.7%
l1991
 
6.5%
y1293
 
4.2%
Other values (16)5661
18.6%
ValueCountFrequency (%)
&26
44.1%
'26
44.1%
.7
 
11.9%
ValueCountFrequency (%)
3408
100.0%
ValueCountFrequency (%)
-212
100.0%
ValueCountFrequency (%)
(1
100.0%
ValueCountFrequency (%)
+1
100.0%
ValueCountFrequency (%)
)1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin35239
90.5%
Common3682
 
9.5%

Most frequent character per script

ValueCountFrequency (%)
e3940
 
11.2%
i3502
 
9.9%
n2707
 
7.7%
t2644
 
7.5%
a2262
 
6.4%
r2244
 
6.4%
o2212
 
6.3%
s2046
 
5.8%
l1991
 
5.6%
y1293
 
3.7%
Other values (42)10398
29.5%
ValueCountFrequency (%)
3408
92.6%
-212
 
5.8%
&26
 
0.7%
'26
 
0.7%
.7
 
0.2%
(1
 
< 0.1%
+1
 
< 0.1%
)1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII38921
100.0%

Most frequent character per block

ValueCountFrequency (%)
e3940
 
10.1%
i3502
 
9.0%
3408
 
8.8%
n2707
 
7.0%
t2644
 
6.8%
a2262
 
5.8%
r2244
 
5.8%
o2212
 
5.7%
s2046
 
5.3%
l1991
 
5.1%
Other values (50)11965
30.7%

stabbr
Categorical

HIGH CARDINALITY
HIGH CORRELATION

Distinct54
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
NY
146 
PA
112 
CA
 
93
MA
 
68
TX
 
67
Other values (49)
1055 

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters3082
Distinct characters24
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)0.2%

Sample

1st rowAL
2nd rowAL
3rd rowAL
4th rowAL
5th rowAL
ValueCountFrequency (%)
NY146
 
9.5%
PA112
 
7.3%
CA93
 
6.0%
MA68
 
4.4%
TX67
 
4.3%
OH62
 
4.0%
IL58
 
3.8%
NC55
 
3.6%
GA46
 
3.0%
FL44
 
2.9%
Other values (44)790
51.3%
Histogram of lengths of the category
ValueCountFrequency (%)
ny146
 
9.5%
pa112
 
7.3%
ca93
 
6.0%
ma68
 
4.4%
tx67
 
4.3%
oh62
 
4.0%
il58
 
3.8%
nc55
 
3.6%
ga46
 
3.0%
fl44
 
2.9%
Other values (44)790
51.3%

Most occurring characters

ValueCountFrequency (%)
A481
15.6%
N399
12.9%
M259
 
8.4%
I224
 
7.3%
C219
 
7.1%
Y174
 
5.6%
O162
 
5.3%
T155
 
5.0%
L145
 
4.7%
P118
 
3.8%
Other values (14)746
24.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter3082
100.0%

Most frequent character per category

ValueCountFrequency (%)
A481
15.6%
N399
12.9%
M259
 
8.4%
I224
 
7.3%
C219
 
7.1%
Y174
 
5.6%
O162
 
5.3%
T155
 
5.0%
L145
 
4.7%
P118
 
3.8%
Other values (14)746
24.2%

Most occurring scripts

ValueCountFrequency (%)
Latin3082
100.0%

Most frequent character per script

ValueCountFrequency (%)
A481
15.6%
N399
12.9%
M259
 
8.4%
I224
 
7.3%
C219
 
7.1%
Y174
 
5.6%
O162
 
5.3%
T155
 
5.0%
L145
 
4.7%
P118
 
3.8%
Other values (14)746
24.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII3082
100.0%

Most frequent character per block

ValueCountFrequency (%)
A481
15.6%
N399
12.9%
M259
 
8.4%
I224
 
7.3%
C219
 
7.1%
Y174
 
5.6%
O162
 
5.3%
T155
 
5.0%
L145
 
4.7%
P118
 
3.8%
Other values (14)746
24.2%

sector
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Private not-for-profit, 4-year or above
1033 
Public, 4-year or above
508 

Length

Max length39
Median length39
Mean length33.72550292
Min length23

Characters and Unicode

Total characters51971
Distinct characters20
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPublic, 4-year or above
2nd rowPublic, 4-year or above
3rd rowPublic, 4-year or above
4th rowPublic, 4-year or above
5th rowPublic, 4-year or above
ValueCountFrequency (%)
Private not-for-profit, 4-year or above1033
67.0%
Public, 4-year or above508
33.0%
Histogram of lengths of the category
ValueCountFrequency (%)
above1541
21.4%
or1541
21.4%
4-year1541
21.4%
not-for-profit1033
14.4%
private1033
14.4%
public508
 
7.1%

Most occurring characters

ValueCountFrequency (%)
r6181
11.9%
o6181
11.9%
5656
10.9%
e4115
 
7.9%
a4115
 
7.9%
-3607
 
6.9%
t3099
 
6.0%
i2574
 
5.0%
v2574
 
5.0%
f2066
 
4.0%
Other values (10)11803
22.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter38085
73.3%
Space Separator5656
 
10.9%
Dash Punctuation3607
 
6.9%
Uppercase Letter1541
 
3.0%
Other Punctuation1541
 
3.0%
Decimal Number1541
 
3.0%

Most frequent character per category

ValueCountFrequency (%)
r6181
16.2%
o6181
16.2%
e4115
10.8%
a4115
10.8%
t3099
8.1%
i2574
6.8%
v2574
6.8%
f2066
 
5.4%
b2049
 
5.4%
y1541
 
4.0%
Other values (5)3590
9.4%
ValueCountFrequency (%)
P1541
100.0%
ValueCountFrequency (%)
,1541
100.0%
ValueCountFrequency (%)
5656
100.0%
ValueCountFrequency (%)
41541
100.0%
ValueCountFrequency (%)
-3607
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin39626
76.2%
Common12345
 
23.8%

Most frequent character per script

ValueCountFrequency (%)
r6181
15.6%
o6181
15.6%
e4115
10.4%
a4115
10.4%
t3099
7.8%
i2574
6.5%
v2574
6.5%
f2066
 
5.2%
b2049
 
5.2%
P1541
 
3.9%
Other values (6)5131
12.9%
ValueCountFrequency (%)
5656
45.8%
-3607
29.2%
,1541
 
12.5%
41541
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII51971
100.0%

Most frequent character per block

ValueCountFrequency (%)
r6181
11.9%
o6181
11.9%
5656
10.9%
e4115
 
7.9%
a4115
 
7.9%
-3607
 
6.9%
t3099
 
6.0%
i2574
 
5.0%
v2574
 
5.0%
f2066
 
4.0%
Other values (10)11803
22.7%

iclevel
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
Four or more years
1541 

Length

Max length18
Median length18
Mean length18
Min length18

Characters and Unicode

Total characters27738
Distinct characters10
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFour or more years
2nd rowFour or more years
3rd rowFour or more years
4th rowFour or more years
5th rowFour or more years
ValueCountFrequency (%)
Four or more years1541
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
four1541
25.0%
years1541
25.0%
more1541
25.0%
or1541
25.0%

Most occurring characters

ValueCountFrequency (%)
r6164
22.2%
o4623
16.7%
4623
16.7%
e3082
11.1%
F1541
 
5.6%
u1541
 
5.6%
m1541
 
5.6%
y1541
 
5.6%
a1541
 
5.6%
s1541
 
5.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter21574
77.8%
Space Separator4623
 
16.7%
Uppercase Letter1541
 
5.6%

Most frequent character per category

ValueCountFrequency (%)
r6164
28.6%
o4623
21.4%
e3082
14.3%
u1541
 
7.1%
m1541
 
7.1%
y1541
 
7.1%
a1541
 
7.1%
s1541
 
7.1%
ValueCountFrequency (%)
F1541
100.0%
ValueCountFrequency (%)
4623
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin23115
83.3%
Common4623
 
16.7%

Most frequent character per script

ValueCountFrequency (%)
r6164
26.7%
o4623
20.0%
e3082
13.3%
F1541
 
6.7%
u1541
 
6.7%
m1541
 
6.7%
y1541
 
6.7%
a1541
 
6.7%
s1541
 
6.7%
ValueCountFrequency (%)
4623
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII27738
100.0%

Most frequent character per block

ValueCountFrequency (%)
r6164
22.2%
o4623
16.7%
4623
16.7%
e3082
11.1%
F1541
 
5.6%
u1541
 
5.6%
m1541
 
5.6%
y1541
 
5.6%
a1541
 
5.6%
s1541
 
5.6%

control
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
Private not-for-profit
1033 
Public
508 

Length

Max length22
Median length22
Mean length16.72550292
Min length6

Characters and Unicode

Total characters25774
Distinct characters17
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPublic
2nd rowPublic
3rd rowPublic
4th rowPublic
5th rowPublic
ValueCountFrequency (%)
Private not-for-profit1033
67.0%
Public508
33.0%
Histogram of lengths of the category
ValueCountFrequency (%)
not-for-profit1033
40.1%
private1033
40.1%
public508
19.7%

Most occurring characters

ValueCountFrequency (%)
r3099
12.0%
t3099
12.0%
o3099
12.0%
i2574
10.0%
-2066
8.0%
f2066
8.0%
P1541
 
6.0%
v1033
 
4.0%
a1033
 
4.0%
e1033
 
4.0%
Other values (7)5131
19.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter21134
82.0%
Dash Punctuation2066
 
8.0%
Uppercase Letter1541
 
6.0%
Space Separator1033
 
4.0%

Most frequent character per category

ValueCountFrequency (%)
r3099
14.7%
t3099
14.7%
o3099
14.7%
i2574
12.2%
f2066
9.8%
v1033
 
4.9%
a1033
 
4.9%
e1033
 
4.9%
n1033
 
4.9%
p1033
 
4.9%
Other values (4)2032
9.6%
ValueCountFrequency (%)
P1541
100.0%
ValueCountFrequency (%)
1033
100.0%
ValueCountFrequency (%)
-2066
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin22675
88.0%
Common3099
 
12.0%

Most frequent character per script

ValueCountFrequency (%)
r3099
13.7%
t3099
13.7%
o3099
13.7%
i2574
11.4%
f2066
9.1%
P1541
6.8%
v1033
 
4.6%
a1033
 
4.6%
e1033
 
4.6%
n1033
 
4.6%
Other values (5)3065
13.5%
ValueCountFrequency (%)
-2066
66.7%
1033
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII25774
100.0%

Most frequent character per block

ValueCountFrequency (%)
r3099
12.0%
t3099
12.0%
o3099
12.0%
i2574
10.0%
-2066
8.0%
f2066
8.0%
P1541
 
6.0%
v1033
 
4.0%
a1033
 
4.0%
e1033
 
4.0%
Other values (7)5131
19.9%

locale
Categorical

HIGH CORRELATION

Distinct12
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
City: Large
342 
Suburb: Large
301 
City: Small
209 
Town: Distant
179 
City: Midsize
172 
Other values (7)
338 

Length

Max length15
Median length13
Mean length12.25892278
Min length11

Characters and Unicode

Total characters18891
Distinct characters27
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCity: Midsize
2nd rowCity: Midsize
3rd rowCity: Midsize
4th rowCity: Midsize
5th rowCity: Small
ValueCountFrequency (%)
City: Large342
22.2%
Suburb: Large301
19.5%
City: Small209
13.6%
Town: Distant179
11.6%
City: Midsize172
11.2%
Town: Remote109
 
7.1%
Town: Fringe62
 
4.0%
Suburb: Midsize51
 
3.3%
Rural: Fringe40
 
2.6%
Suburb: Small31
 
2.0%
Other values (2)45
 
2.9%
Histogram of lengths of the category
ValueCountFrequency (%)
city723
23.5%
large643
20.9%
suburb383
12.4%
town350
11.4%
small240
 
7.8%
midsize223
 
7.2%
distant208
 
6.7%
remote125
 
4.1%
fringe102
 
3.3%
rural85
 
2.8%

Most occurring characters

ValueCountFrequency (%)
:1541
 
8.2%
1541
 
8.2%
i1479
 
7.8%
t1264
 
6.7%
e1218
 
6.4%
r1213
 
6.4%
a1176
 
6.2%
u851
 
4.5%
b766
 
4.1%
g745
 
3.9%
Other values (17)7097
37.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter12727
67.4%
Uppercase Letter3082
 
16.3%
Other Punctuation1541
 
8.2%
Space Separator1541
 
8.2%

Most frequent character per category

ValueCountFrequency (%)
i1479
11.6%
t1264
9.9%
e1218
9.6%
r1213
9.5%
a1176
9.2%
u851
 
6.7%
b766
 
6.0%
g745
 
5.9%
y723
 
5.7%
n660
 
5.2%
Other values (7)2632
20.7%
ValueCountFrequency (%)
C723
23.5%
L643
20.9%
S623
20.2%
T350
11.4%
M223
 
7.2%
R210
 
6.8%
D208
 
6.7%
F102
 
3.3%
ValueCountFrequency (%)
:1541
100.0%
ValueCountFrequency (%)
1541
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin15809
83.7%
Common3082
 
16.3%

Most frequent character per script

ValueCountFrequency (%)
i1479
 
9.4%
t1264
 
8.0%
e1218
 
7.7%
r1213
 
7.7%
a1176
 
7.4%
u851
 
5.4%
b766
 
4.8%
g745
 
4.7%
C723
 
4.6%
y723
 
4.6%
Other values (15)5651
35.7%
ValueCountFrequency (%)
:1541
50.0%
1541
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII18891
100.0%

Most frequent character per block

ValueCountFrequency (%)
:1541
 
8.2%
1541
 
8.2%
i1479
 
7.8%
t1264
 
6.7%
e1218
 
6.4%
r1213
 
6.4%
a1176
 
6.2%
u851
 
4.5%
b766
 
4.1%
g745
 
3.9%
Other values (17)7097
37.6%

instcat
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Degree-granting, primarily baccalaureate or above
1504 
Degree-granting, not primarily baccalaureate or above
 
37

Length

Max length53
Median length49
Mean length49.09604153
Min length49

Characters and Unicode

Total characters75657
Distinct characters20
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDegree-granting, primarily baccalaureate or above
2nd rowDegree-granting, primarily baccalaureate or above
3rd rowDegree-granting, primarily baccalaureate or above
4th rowDegree-granting, primarily baccalaureate or above
5th rowDegree-granting, primarily baccalaureate or above
ValueCountFrequency (%)
Degree-granting, primarily baccalaureate or above1504
97.6%
Degree-granting, not primarily baccalaureate or above37
 
2.4%
Histogram of lengths of the category
ValueCountFrequency (%)
baccalaureate1541
19.9%
above1541
19.9%
degree-granting1541
19.9%
primarily1541
19.9%
or1541
19.9%
not37
 
0.5%

Most occurring characters

ValueCountFrequency (%)
a10787
14.3%
e9246
12.2%
r9246
12.2%
6201
 
8.2%
g4623
 
6.1%
i4623
 
6.1%
n3119
 
4.1%
t3119
 
4.1%
o3119
 
4.1%
l3082
 
4.1%
Other values (10)18492
24.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter64833
85.7%
Space Separator6201
 
8.2%
Uppercase Letter1541
 
2.0%
Dash Punctuation1541
 
2.0%
Other Punctuation1541
 
2.0%

Most frequent character per category

ValueCountFrequency (%)
a10787
16.6%
e9246
14.3%
r9246
14.3%
g4623
7.1%
i4623
7.1%
n3119
 
4.8%
t3119
 
4.8%
o3119
 
4.8%
l3082
 
4.8%
b3082
 
4.8%
Other values (6)10787
16.6%
ValueCountFrequency (%)
D1541
100.0%
ValueCountFrequency (%)
-1541
100.0%
ValueCountFrequency (%)
,1541
100.0%
ValueCountFrequency (%)
6201
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin66374
87.7%
Common9283
 
12.3%

Most frequent character per script

ValueCountFrequency (%)
a10787
16.3%
e9246
13.9%
r9246
13.9%
g4623
 
7.0%
i4623
 
7.0%
n3119
 
4.7%
t3119
 
4.7%
o3119
 
4.7%
l3082
 
4.6%
b3082
 
4.6%
Other values (7)12328
18.6%
ValueCountFrequency (%)
6201
66.8%
-1541
 
16.6%
,1541
 
16.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII75657
100.0%

Most frequent character per block

ValueCountFrequency (%)
a10787
14.3%
e9246
12.2%
r9246
12.2%
6201
 
8.2%
g4623
 
6.1%
i4623
 
6.1%
n3119
 
4.1%
t3119
 
4.1%
o3119
 
4.1%
l3082
 
4.1%
Other values (10)18492
24.4%

c15basic
Categorical

HIGH CORRELATION

Distinct19
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
Master^s Colleges & Universities: Larger Programs
332 
Baccalaureate Colleges: Arts & Sciences Focus
231 
Baccalaureate Colleges: Diverse Fields
199 
Master^s Colleges & Universities: Medium Programs
166 
Doctoral Universities: Highest Research Activity
114 
Other values (14)
499 

Length

Max length79
Median length49
Mean length47.63724854
Min length15

Characters and Unicode

Total characters73409
Distinct characters48
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st rowMaster^s Colleges & Universities: Larger Programs
2nd rowDoctoral Universities: Highest Research Activity
3rd rowDoctoral Universities: Higher Research Activity
4th rowMaster^s Colleges & Universities: Medium Programs
5th rowDoctoral Universities: Higher Research Activity
ValueCountFrequency (%)
Master^s Colleges & Universities: Larger Programs332
21.5%
Baccalaureate Colleges: Arts & Sciences Focus231
15.0%
Baccalaureate Colleges: Diverse Fields199
12.9%
Master^s Colleges & Universities: Medium Programs166
10.8%
Doctoral Universities: Highest Research Activity114
 
7.4%
Doctoral Universities: Higher Research Activity100
 
6.5%
Special Focus Four-Year: Faith-Related Institutions99
 
6.4%
Master^s Colleges & Universities: Small Programs98
 
6.4%
Doctoral Universities: Moderate Research Activity84
 
5.5%
Special Focus Four-Year: Arts, Music & Design Schools41
 
2.7%
Other values (9)77
 
5.0%
Histogram of lengths of the category
ValueCountFrequency (%)
colleges1053
 
12.3%
universities894
 
10.4%
876
 
10.2%
master^s596
 
7.0%
programs596
 
7.0%
baccalaureate430
 
5.0%
focus406
 
4.7%
larger332
 
3.9%
activity298
 
3.5%
research298
 
3.5%
Other values (39)2791
32.6%

Most occurring characters

ValueCountFrequency (%)
e8903
12.1%
7029
 
9.6%
s6942
 
9.5%
r5571
 
7.6%
i5068
 
6.9%
a4932
 
6.7%
t4019
 
5.5%
l3675
 
5.0%
o3322
 
4.5%
c3105
 
4.2%
Other values (38)20843
28.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter55037
75.0%
Uppercase Letter7869
 
10.7%
Space Separator7029
 
9.6%
Other Punctuation2503
 
3.4%
Modifier Symbol645
 
0.9%
Dash Punctuation290
 
0.4%
Open Punctuation18
 
< 0.1%
Close Punctuation18
 
< 0.1%

Most frequent character per category

ValueCountFrequency (%)
e8903
16.2%
s6942
12.6%
r5571
10.1%
i5068
9.2%
a4932
9.0%
t4019
7.3%
l3675
6.7%
o3322
 
6.0%
c3105
 
5.6%
g2328
 
4.2%
Other values (11)7172
13.0%
ValueCountFrequency (%)
C1071
13.6%
M918
11.7%
U894
11.4%
F876
11.1%
A619
7.9%
P612
7.8%
S574
7.3%
D539
6.8%
B485
6.2%
R397
 
5.0%
Other values (8)884
11.2%
ValueCountFrequency (%)
:1521
60.8%
&876
35.0%
,59
 
2.4%
/47
 
1.9%
ValueCountFrequency (%)
^645
100.0%
ValueCountFrequency (%)
7029
100.0%
ValueCountFrequency (%)
-290
100.0%
ValueCountFrequency (%)
(18
100.0%
ValueCountFrequency (%)
)18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin62906
85.7%
Common10503
 
14.3%

Most frequent character per script

ValueCountFrequency (%)
e8903
14.2%
s6942
11.0%
r5571
 
8.9%
i5068
 
8.1%
a4932
 
7.8%
t4019
 
6.4%
l3675
 
5.8%
o3322
 
5.3%
c3105
 
4.9%
g2328
 
3.7%
Other values (29)15041
23.9%
ValueCountFrequency (%)
7029
66.9%
:1521
 
14.5%
&876
 
8.3%
^645
 
6.1%
-290
 
2.8%
,59
 
0.6%
/47
 
0.4%
(18
 
0.2%
)18
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII73409
100.0%

Most frequent character per block

ValueCountFrequency (%)
e8903
12.1%
7029
 
9.6%
s6942
 
9.5%
r5571
 
7.6%
i5068
 
6.9%
a4932
 
6.7%
t4019
 
5.5%
l3675
 
5.0%
o3322
 
4.5%
c3105
 
4.2%
Other values (38)20843
28.4%

instsize
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
1,000 - 4,999
714 
Under 1,000
295 
5,000 - 9,999
220 
10,000 - 19,999
163 
20,000 and above
149 

Length

Max length16
Median length13
Mean length13.11875406
Min length11

Characters and Unicode

Total characters20216
Distinct characters18
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5,000 - 9,999
2nd row20,000 and above
3rd row5,000 - 9,999
4th row1,000 - 4,999
5th row20,000 and above
ValueCountFrequency (%)
1,000 - 4,999714
46.3%
Under 1,000295
19.1%
5,000 - 9,999220
 
14.3%
10,000 - 19,999163
 
10.6%
20,000 and above149
 
9.7%
Histogram of lengths of the category
ValueCountFrequency (%)
1097
25.3%
1,0001009
23.3%
4,999714
16.5%
under295
 
6.8%
9,999220
 
5.1%
5,000220
 
5.1%
19,999163
 
3.8%
10,000163
 
3.8%
above149
 
3.4%
and149
 
3.4%

Most occurring characters

ValueCountFrequency (%)
04935
24.4%
93674
18.2%
2787
13.8%
,2638
13.0%
11335
 
6.6%
-1097
 
5.4%
4714
 
3.5%
n444
 
2.2%
d444
 
2.2%
e444
 
2.2%
Other values (8)1704
 
8.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number11027
54.5%
Space Separator2787
 
13.8%
Other Punctuation2638
 
13.0%
Lowercase Letter2372
 
11.7%
Dash Punctuation1097
 
5.4%
Uppercase Letter295
 
1.5%

Most frequent character per category

ValueCountFrequency (%)
n444
18.7%
d444
18.7%
e444
18.7%
a298
12.6%
r295
12.4%
b149
 
6.3%
o149
 
6.3%
v149
 
6.3%
ValueCountFrequency (%)
04935
44.8%
93674
33.3%
11335
 
12.1%
4714
 
6.5%
5220
 
2.0%
2149
 
1.4%
ValueCountFrequency (%)
,2638
100.0%
ValueCountFrequency (%)
2787
100.0%
ValueCountFrequency (%)
-1097
100.0%
ValueCountFrequency (%)
U295
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common17549
86.8%
Latin2667
 
13.2%

Most frequent character per script

ValueCountFrequency (%)
04935
28.1%
93674
20.9%
2787
15.9%
,2638
15.0%
11335
 
7.6%
-1097
 
6.3%
4714
 
4.1%
5220
 
1.3%
2149
 
0.8%
ValueCountFrequency (%)
n444
16.6%
d444
16.6%
e444
16.6%
a298
11.2%
U295
11.1%
r295
11.1%
b149
 
5.6%
o149
 
5.6%
v149
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII20216
100.0%

Most frequent character per block

ValueCountFrequency (%)
04935
24.4%
93674
18.2%
2787
13.8%
,2638
13.0%
11335
 
6.6%
-1097
 
5.4%
4714
 
3.5%
n444
 
2.2%
d444
 
2.2%
e444
 
2.2%
Other values (8)1704
 
8.4%

longitud
Real number (ℝ)

Distinct1540
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-87.54975128
Minimum-157.92659
Maximum144.8358154
Zeros0
Zeros (%)0.0%
Memory size6.1 KiB

Quantile statistics

Minimum-157.92659
5-th percentile-120.424942
Q1-93.73083496
median-84.06147766
Q3-76.4913559
95-th percentile-71.45894623
Maximum144.8358154
Range302.7624054
Interquartile range (IQR)17.23947906

Descriptive statistics

Standard deviation15.92879677
Coefficient of variation (CV)-0.1819399446
Kurtosis30.44776535
Mean-87.54975128
Median Absolute Deviation (MAD)8.427337646
Skewness0.8641968369
Sum-134914.1719
Variance253.7265778
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-78.637695312
 
0.1%
-72.249946591
 
0.1%
-87.585281371
 
0.1%
-72.583976751
 
0.1%
-75.18290711
 
0.1%
-73.670234681
 
0.1%
-73.83425141
 
0.1%
-92.084388731
 
0.1%
-80.240493771
 
0.1%
-97.444213871
 
0.1%
Other values (1530)1530
99.3%
ValueCountFrequency (%)
-157.926591
0.1%
-157.8883821
0.1%
-157.85964971
0.1%
-157.8189851
0.1%
-157.80784611
0.1%
ValueCountFrequency (%)
144.83581541
0.1%
-64.972862241
0.1%
-66.050010681
0.1%
-66.059616091
0.1%
-66.16206361
0.1%

latitude
Real number (ℝ≥0)

UNIQUE

Distinct1541
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38.76873398
Minimum13.46504593
Maximum64.8575592
Zeros0
Zeros (%)0.0%
Memory size6.1 KiB

Quantile statistics

Minimum13.46504593
5-th percentile30.09242439
Q135.58656311
median40.01423645
Q341.96385956
95-th percentile44.9728508
Maximum64.8575592
Range51.39251328
Interquartile range (IQR)6.377296448

Descriptive statistics

Standard deviation4.909657478
Coefficient of variation (CV)0.1266396195
Kurtosis2.622907639
Mean38.76873398
Median Absolute Deviation (MAD)2.62216568
Skewness-0.6991415024
Sum59742.62109
Variance24.10473633
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
42.249988561
 
0.1%
35.048248291
 
0.1%
39.326797491
 
0.1%
42.667522431
 
0.1%
29.646289831
 
0.1%
40.292648321
 
0.1%
33.417720791
 
0.1%
47.666530611
 
0.1%
42.131587981
 
0.1%
40.448432921
 
0.1%
Other values (1531)1531
99.4%
ValueCountFrequency (%)
13.465045931
0.1%
18.001649861
0.1%
18.083530431
0.1%
18.118820191
0.1%
18.20532991
0.1%
ValueCountFrequency (%)
64.85755921
0.1%
61.190967561
0.1%
61.190162661
0.1%
58.384845731
0.1%
48.737812041
0.1%

roomcap
Real number (ℝ≥0)

Distinct1211
Distinct (%)78.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1921.072031
Minimum1
Maximum18313
Zeros0
Zeros (%)0.0%
Memory size12.2 KiB

Quantile statistics

Minimum1
5-th percentile125
Q1554
median1154
Q32317
95-th percentile6597
Maximum18313
Range18312
Interquartile range (IQR)1763

Descriptive statistics

Standard deviation2338.427154
Coefficient of variation (CV)1.217251158
Kurtosis10.89681744
Mean1921.072031
Median Absolute Deviation (MAD)743
Skewness2.897598962
Sum2960372
Variance5468241.553
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10010
 
0.6%
4007
 
0.5%
6107
 
0.5%
1207
 
0.5%
1506
 
0.4%
2006
 
0.4%
5006
 
0.4%
14005
 
0.3%
6505
 
0.3%
12005
 
0.3%
Other values (1201)1477
95.8%
ValueCountFrequency (%)
11
0.1%
21
0.1%
41
0.1%
51
0.1%
101
0.1%
ValueCountFrequency (%)
183131
0.1%
180001
0.1%
170601
0.1%
159171
0.1%
151981
0.1%

roomamt
Real number (ℝ≥0)

ZEROS

Distinct888
Distinct (%)57.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4884.04867
Minimum0
Maximum16675
Zeros309
Zeros (%)20.1%
Memory size12.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q13300
median5300
Q36904
95-th percentile9510
Maximum16675
Range16675
Interquartile range (IQR)3604

Descriptive statistics

Standard deviation3106.429487
Coefficient of variation (CV)0.6360357353
Kurtosis-0.2552157725
Mean4884.04867
Median Absolute Deviation (MAD)1832
Skewness-0.08637039418
Sum7526319
Variance9649904.155
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0309
 
20.1%
500010
 
0.6%
52009
 
0.6%
54008
 
0.5%
57508
 
0.5%
61808
 
0.5%
34007
 
0.5%
66007
 
0.5%
43007
 
0.5%
42007
 
0.5%
Other values (878)1161
75.3%
ValueCountFrequency (%)
0309
20.1%
4601
 
0.1%
9501
 
0.1%
10001
 
0.1%
12001
 
0.1%
ValueCountFrequency (%)
166751
0.1%
161151
0.1%
153001
0.1%
150002
0.1%
145941
0.1%

boardamt
Real number (ℝ≥0)

ZEROS

Distinct758
Distinct (%)49.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3490.780013
Minimum0
Maximum8965
Zeros376
Zeros (%)24.4%
Memory size12.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11950
median4064
Q35080
95-th percentile6374
Maximum8965
Range8965
Interquartile range (IQR)3130

Descriptive statistics

Standard deviation2224.323645
Coefficient of variation (CV)0.637199605
Kurtosis-0.9403457278
Mean3490.780013
Median Absolute Deviation (MAD)1148
Skewness-0.523572115
Sum5379292
Variance4947615.679
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0376
 
24.4%
510013
 
0.8%
300010
 
0.6%
40009
 
0.6%
46008
 
0.5%
42008
 
0.5%
39007
 
0.5%
36007
 
0.5%
50007
 
0.5%
48007
 
0.5%
Other values (748)1089
70.7%
ValueCountFrequency (%)
0376
24.4%
4901
 
0.1%
6001
 
0.1%
7001
 
0.1%
8401
 
0.1%
ValueCountFrequency (%)
89651
0.1%
84801
0.1%
83301
0.1%
82101
0.1%
80381
0.1%

applfeeu
Real number (ℝ≥0)

ZEROS

Distinct33
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.87475665
Minimum0
Maximum200
Zeros446
Zeros (%)28.9%
Memory size12.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median30
Q350
95-th percentile75
Maximum200
Range200
Interquartile range (IQR)50

Descriptive statistics

Standard deviation25.68658283
Coefficient of variation (CV)0.8058597314
Kurtosis0.9462515356
Mean31.87475665
Median Absolute Deviation (MAD)20
Skewness0.5039590672
Sum49119
Variance659.8005377
MonotocityNot monotonic
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
0446
28.9%
50234
15.2%
25159
 
10.3%
40111
 
7.2%
30107
 
6.9%
35103
 
6.7%
6554
 
3.5%
6054
 
3.5%
2050
 
3.2%
7546
 
3.0%
Other values (23)177
 
11.5%
ValueCountFrequency (%)
0446
28.9%
103
 
0.2%
156
 
0.4%
2050
 
3.2%
25159
 
10.3%
ValueCountFrequency (%)
2001
 
0.1%
1502
0.1%
1251
 
0.1%
1202
0.1%
1103
0.2%

applcn
Real number (ℝ≥0)

Distinct1408
Distinct (%)91.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6764.367943
Minimum3
Maximum102225
Zeros0
Zeros (%)0.0%
Memory size12.2 KiB

Quantile statistics

Minimum3
5-th percentile70
Q11256
median3211
Q37367
95-th percentile27679
Maximum102225
Range102222
Interquartile range (IQR)6111

Descriptive statistics

Standard deviation10403.92701
Coefficient of variation (CV)1.538048654
Kurtosis18.40607631
Mean6764.367943
Median Absolute Deviation (MAD)2413
Skewness3.651581663
Sum10423891
Variance108241697.2
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
54
 
0.3%
16934
 
0.3%
63
 
0.2%
303
 
0.2%
3883
 
0.2%
22313
 
0.2%
8733
 
0.2%
18433
 
0.2%
7463
 
0.2%
253
 
0.2%
Other values (1398)1509
97.9%
ValueCountFrequency (%)
33
0.2%
54
0.3%
63
0.2%
72
0.1%
82
0.1%
ValueCountFrequency (%)
1022251
0.1%
884461
0.1%
850921
0.1%
850441
0.1%
818241
0.1%

admssn
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1358
Distinct (%)88.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3787.340039
Minimum2
Maximum31878
Zeros0
Zeros (%)0.0%
Memory size12.2 KiB

Quantile statistics

Minimum2
5-th percentile48
Q1824
median2037
Q34600
95-th percentile14876
Maximum31878
Range31876
Interquartile range (IQR)3776

Descriptive statistics

Standard deviation4923.005565
Coefficient of variation (CV)1.299858348
Kurtosis7.281832763
Mean3787.340039
Median Absolute Deviation (MAD)1515
Skewness2.503095216
Sum5836291
Variance24235983.79
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
64
 
0.3%
184
 
0.3%
34
 
0.3%
74
 
0.3%
154
 
0.3%
113
 
0.2%
8073
 
0.2%
283
 
0.2%
8633
 
0.2%
13753
 
0.2%
Other values (1348)1506
97.7%
ValueCountFrequency (%)
21
 
0.1%
34
0.3%
41
 
0.1%
52
0.1%
64
0.3%
ValueCountFrequency (%)
318781
0.1%
310631
0.1%
307621
0.1%
300611
0.1%
298121
0.1%

enrlft
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1017
Distinct (%)66.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean979.3361454
Minimum0
Maximum10099
Zeros1
Zeros (%)0.1%
Memory size12.2 KiB

Quantile statistics

Minimum0
5-th percentile28
Q1218
median465
Q31122
95-th percentile4040
Maximum10099
Range10099
Interquartile range (IQR)904

Descriptive statistics

Standard deviation1340.264785
Coefficient of variation (CV)1.368544183
Kurtosis8.059855814
Mean979.3361454
Median Absolute Deviation (MAD)312
Skewness2.653629428
Sum1509157
Variance1796309.695
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
97
 
0.5%
2766
 
0.4%
136
 
0.4%
2046
 
0.4%
36
 
0.4%
2816
 
0.4%
3415
 
0.3%
25
 
0.3%
1715
 
0.3%
1645
 
0.3%
Other values (1007)1484
96.3%
ValueCountFrequency (%)
01
 
0.1%
25
0.3%
36
0.4%
43
0.2%
51
 
0.1%
ValueCountFrequency (%)
100991
0.1%
82381
0.1%
81401
0.1%
79751
0.1%
78321
0.1%

enrlt
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1030
Distinct (%)66.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1004.758598
Minimum2
Maximum11639
Zeros0
Zeros (%)0.0%
Memory size12.2 KiB

Quantile statistics

Minimum2
5-th percentile30
Q1220
median471
Q31145
95-th percentile4153
Maximum11639
Range11637
Interquartile range (IQR)925

Descriptive statistics

Standard deviation1385.421818
Coefficient of variation (CV)1.378860375
Kurtosis8.565152258
Mean1004.758598
Median Absolute Deviation (MAD)317
Skewness2.693758064
Sum1548333
Variance1919393.613
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2796
 
0.4%
2036
 
0.4%
96
 
0.4%
136
 
0.4%
1716
 
0.4%
1835
 
0.3%
4335
 
0.3%
1975
 
0.3%
2675
 
0.3%
5505
 
0.3%
Other values (1020)1486
96.4%
ValueCountFrequency (%)
23
0.2%
35
0.3%
44
0.3%
52
 
0.1%
62
 
0.1%
ValueCountFrequency (%)
116391
0.1%
83811
0.1%
83661
0.1%
80011
0.1%
78741
0.1%

accept
Real number (ℝ≥0)

Distinct1496
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.6672388599
Minimum0.01852300829
Maximum1
Zeros0
Zeros (%)0.0%
Memory size12.2 KiB

Quantile statistics

Minimum0.01852300829
5-th percentile0.27330197
Q10.5406758448
median0.6911189358
Q30.8192371476
95-th percentile0.9634703196
Maximum1
Range0.9814769917
Interquartile range (IQR)0.2785613028

Descriptive statistics

Standard deviation0.20701538
Coefficient of variation (CV)0.3102567797
Kurtosis0.148854929
Mean0.6672388599
Median Absolute Deviation (MAD)0.136942824
Skewness-0.6654676229
Sum1028.215083
Variance0.04285536754
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128
 
1.8%
0.83
 
0.2%
0.83333333333
 
0.2%
0.85714285713
 
0.2%
0.94444444442
 
0.1%
0.8752
 
0.1%
0.93333333332
 
0.1%
0.92307692312
 
0.1%
0.74358974362
 
0.1%
0.86666666672
 
0.1%
Other values (1486)1492
96.8%
ValueCountFrequency (%)
0.018523008291
0.1%
0.033039647581
0.1%
0.047307875571
0.1%
0.051561788081
0.1%
0.059208136581
0.1%
ValueCountFrequency (%)
128
1.8%
0.99629629631
 
0.1%
0.99602272731
 
0.1%
0.99587584111
 
0.1%
0.99516908211
 
0.1%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

unitidinstnmstabbrsectoriclevelcontrollocaleinstcatc15basicinstsizelongitudlatituderoomcaproomamtboardamtapplfeeuapplcnadmssnenrlftenrltaccept
0100654Alabama A & M UniversityALPublic, 4-year or aboveFour or more yearsPublicCity: MidsizeDegree-granting, primarily baccalaureate or aboveMaster^s Colleges & Universities: Larger Programs5,000 - 9,999-86.56850434.7833672614.05400.03620.030.08610.07772.01288.01294.00.902671
1100663University of Alabama at BirminghamALPublic, 4-year or aboveFour or more yearsPublicCity: MidsizeDegree-granting, primarily baccalaureate or aboveDoctoral Universities: Highest Research Activity20,000 and above-86.79934733.5056952785.07532.04150.030.07555.06936.02228.02299.00.918068
2100706University of Alabama in HuntsvilleALPublic, 4-year or aboveFour or more yearsPublicCity: MidsizeDegree-granting, primarily baccalaureate or aboveDoctoral Universities: Higher Research Activity5,000 - 9,999-86.64045034.7245561652.00.00.030.04454.03618.01341.01352.00.812304
3100724Alabama State UniversityALPublic, 4-year or aboveFour or more yearsPublicCity: MidsizeDegree-granting, primarily baccalaureate or aboveMaster^s Colleges & Universities: Medium Programs1,000 - 4,999-86.29567732.3643192491.03346.02076.025.06842.06696.0951.0967.00.978661
4100751The University of AlabamaALPublic, 4-year or aboveFour or more yearsPublicCity: SmallDegree-granting, primarily baccalaureate or aboveDoctoral Universities: Higher Research Activity20,000 and above-87.54597533.2118768449.05750.03674.040.038129.020321.07385.07407.00.532954
5100830Auburn University at MontgomeryALPublic, 4-year or aboveFour or more yearsPublicCity: MidsizeDegree-granting, primarily baccalaureate or aboveMaster^s Colleges & Universities: Larger Programs1,000 - 4,999-86.17754432.3673591200.04580.02400.00.02454.02022.0627.0652.00.823961
6100858Auburn UniversityALPublic, 4-year or aboveFour or more yearsPublicCity: SmallDegree-granting, primarily baccalaureate or aboveDoctoral Universities: Higher Research Activity20,000 and above-85.48825832.5993774737.07860.05472.050.018072.015168.04771.04836.00.839309
7100937Birmingham Southern CollegeALPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: MidsizeDegree-granting, primarily baccalaureate or aboveBaccalaureate Colleges: Arts & Sciences Focus1,000 - 4,999-86.85055533.5137751597.00.00.050.02559.01583.0349.0349.00.618601
8101189Faulkner UniversityALPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: MidsizeDegree-granting, primarily baccalaureate or aboveMaster^s Colleges & Universities: Small Programs1,000 - 4,999-86.21640832.384182704.03500.03900.025.02335.01191.0311.0333.00.510064
9101435Huntingdon CollegeALPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: MidsizeDegree-granting, primarily baccalaureate or aboveBaccalaureate Colleges: Diverse Fields1,000 - 4,999-86.28436332.351032629.00.00.00.02074.01161.0294.0294.00.559788

Last rows

unitidinstnmstabbrsectoriclevelcontrollocaleinstcatc15basicinstsizelongitudlatituderoomcaproomamtboardamtapplfeeuapplcnadmssnenrlftenrltaccept
1531488314Beth Medrash of Asbury ParkNJPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitSuburb: LargeDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-74.21081540.04370560.00.00.00.030.026.023.023.00.866667
1532488350Yeshiva Gedolah Shaarei ShmuelNJPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: SmallDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-74.19759440.09100090.00.00.00.0100.036.027.027.00.360000
1533488785University of Saint KatherineCAPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitSuburb: LargeDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-117.19654833.15128355.09000.00.020.076.028.016.016.00.368421
1534488819The Colburn Conservatory of MusicCAPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: LargeDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-118.24984034.054070150.012360.05879.0120.0287.018.018.018.00.062718
1535489937Piedmont International UniversityNCPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: MidsizeDegree-granting, primarily baccalaureate or aboveSpecial Focus Four-Year: Faith-Related InstitutionsUnder 1,000-80.25015336.087963185.00.00.039.0256.093.064.065.00.363281
1536490319Yeshiva Bais AharonNJPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: SmallDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-74.20309440.10100250.01400.0700.00.023.018.013.013.00.782609
1537490504Yeshiva Ohr NaftoliNYPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitSuburb: LargeDegree-granting, not primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-74.04628841.456985100.00.00.0100.015.015.011.011.01.000000
1538490513Bais Medrash Mayan HatorahNJPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: SmallDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-74.20433040.10741469.00.00.00.030.025.021.021.00.833333
1539490805Purdue University NorthwestINPublic, 4-year or aboveFour or more yearsPublicSuburb: LargeDegree-granting, primarily baccalaureate or aboveMaster^s Colleges & Universities: Larger Programs10,000 - 19,999-87.47423641.584324744.05595.00.025.04136.01434.01070.01125.00.346712
1540491057Yeshiva Kollel Tifereth ElizerNYPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: LargeDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-73.99217240.63709628.02000.03000.00.015.014.012.012.00.933333